45 research outputs found

    Folding of small proteins: A matter of geometry?

    Full text link
    We review some of our recent results obtained within the scope of simple lattice models and Monte Carlo simulations that illustrate the role of native geometry in the folding kinetics of two state folders.Comment: To appear in Molecular Physic

    MIRRAGGE – Minimum Information Required for Reproducible AGGregation Experiments

    Get PDF
    Reports on phase separation and amyloid formation for multiple proteins and aggregation-prone peptides are recurrently used to explore the molecular mechanisms associated with several human diseases. The information conveyed by these reports can be used directly in translational investigation, e.g., for the design of better drug screening strategies, or be compiled in databases for benchmarking novel aggregation-predicting algorithms. Given that minute protocol variations determine different outcomes of protein aggregation assays, there is a strong urge for standardized descriptions of the different types of aggregates and the detailed methods used in their production. In an attempt to address this need, we assembled the Minimum Information Required for Reproducible Aggregation Experiments (MIRRAGGE) guidelines, considering first-principles and the established literature on protein self-assembly and aggregation. This consensus information aims to cover the major and subtle determinants of experimental reproducibility while avoiding excessive technical details that are of limited practical interest for non-specialized users. The MIRRAGGE table (template available in Supplementary Information) is useful as a guide for the design of new studies and as a checklist during submission of experimental reports for publication. Full disclosure of relevant information also enables other researchers to reproduce results correctly and facilitates systematic data deposition into curated databases.This work was supported by (i) the European Regional Development Fund (ERDF) through the COMPETE 2020β€”Operacional Programme for Competitiveness and Internationalisation (POCI), Portugal 2020, and by Portuguese funds through FCTβ€”Fundação para a CiΓͺncia e a Tecnologia (FCT/MCTES) in the framework of grants POCI-01-0145-FEDER-031173, POCI-01-0145-FEDER-007274, POCI-01-0145-FEDER-031323 (β€œInstitute for Research and Innovation in Health Sciences”), UID/Multi/04046/2013 (BioISI) and PTDC/NEUNMC/2138/2014 (to CMG). SV was funded by the Spanish Ministry of Economy and Competitiveness (BIO2016-78310-R) and by ICREA (ICREA-Academia 2015). ZG and ZB were funded by Slovak research agentures VEGA 02/0145/17, 02/0030/18 and APVV-18-0284. RS was funded by VEGA 02/0163/19. DEO was funded by the Lundbeck Foundation (grant no. R276-2018-671) and the Independent Research Foundation Denmark | Natural Sciences (grant no. 8021-00208B). AP research was supported by UK Dementia Research Institute (RE1 3556) and by ARUK (ARUK-PG2019B-020)

    Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index

    Get PDF
    BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset

    A microscale protein NMR sample screening pipeline

    Get PDF
    As part of efforts to develop improved methods for NMR protein sample preparation and structure determination, the Northeast Structural Genomics Consortium (NESG) has implemented an NMR screening pipeline for protein target selection, construct optimization, and buffer optimization, incorporating efficient microscale NMR screening of proteins using a micro-cryoprobe. The process is feasible because the newest generation probe requires only small amounts of protein, typically 30–200Β ΞΌg in 8–35Β ΞΌl volume. Extensive automation has been made possible by the combination of database tools, mechanization of key process steps, and the use of a micro-cryoprobe that gives excellent data while requiring little optimization and manual setup. In this perspective, we describe the overall process used by the NESG for screening NMR samples as part of a sample optimization process, assessing optimal construct design and solution conditions, as well as for determining protein rotational correlation times in order to assess protein oligomerization states. Database infrastructure has been developed to allow for flexible implementation of new screening protocols and harvesting of the resulting output. The NESG micro NMR screening pipeline has also been used for detergent screening of membrane proteins. Descriptions of the individual steps in the NESG NMR sample design, production, and screening pipeline are presented in the format of a standard operating procedure

    Improved general regression network for protein domain boundary prediction

    Get PDF
    Background: Protein domains present some of the most useful information that can be used to understand protein structure and functions. Recent research on protein domain boundary prediction has been mainly based on widely known machine learning techniques, such as Artificial Neural Networks and Support Vector Machines. In this study, we propose a new machine learning model (IGRN) that can achieve accurate and reliable classification, with significantly reduced computations. The IGRN was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. Results: The proposed model achieved average prediction accuracy of 67% on the Benchmark_2 dataset for domain boundary identification in multi-domains proteins and showed superior predictive performance and generalisation ability among the most widely used neural network models. With the CASP7 benchmark dataset, it also demonstrated comparable performance to existing domain boundary predictors such as DOMpro, DomPred, DomSSEA, DomCut and DomainDiscovery with 70.10% prediction accuracy. Conclusion: The performance of proposed model has been compared favourably to the performance of other existing machine learning based methods as well as widely known domain boundary predictors on two benchmark datasets and excels in the identification of domain boundaries in terms of model bias, generalisation and computational requirements. Β© 2008 Yoo et al; licensee BioMed Central Ltd

    Rules Governing Selective Protein Carbonylation

    Get PDF
    BACKGROUND:Carbonyl derivatives are mainly formed by direct metal-catalysed oxidation (MCO) attacks on the amino-acid side chains of proline, arginine, lysine and threonine residues. For reasons unknown, only some proteins are prone to carbonylation. METHODOLOGY/PRINCIPAL FINDINGS:we used mass spectrometry analysis to identify carbonylated sites in: BSA that had undergone in vitro MCO, and 23 carbonylated proteins in Escherichia coli. The presence of a carbonylated site rendered the neighbouring carbonylatable site more prone to carbonylation. Most carbonylated sites were present within hot spots of carbonylation. These observations led us to suggest rules for identifying sites more prone to carbonylation. We used these rules to design an in silico model (available at http://www.lcb.cnrs-mrs.fr/CSPD/), allowing an effective and accurate prediction of sites and of proteins more prone to carbonylation in the E. coli proteome. CONCLUSIONS/SIGNIFICANCE:We observed that proteins evolve to either selectively maintain or lose predicted hot spots of carbonylation depending on their biological function. As our predictive model also allows efficient detection of carbonylated proteins in Bacillus subtilis, we believe that our model may be extended to direct MCO attacks in all organisms

    Multiple Routes and Milestones in the Folding of HIV–1 Protease Monomer

    Get PDF
    Proteins fold on a time scale incompatible with a mechanism of random search in conformational space thus indicating that somehow they are guided to the native state through a funneled energetic landscape. At the same time the heterogeneous kinetics suggests the existence of several different folding routes. Here we propose a scenario for the folding mechanism of the monomer of HIV–1 protease in which multiple pathways and milestone events coexist. A variety of computational approaches supports this picture. These include very long all-atom molecular dynamics simulations in explicit solvent, an analysis of the network of clusters found in multiple high-temperature unfolding simulations and a complete characterization of free-energy surfaces carried out using a structure-based potential at atomistic resolution and a combination of metadynamics and parallel tempering. Our results confirm that the monomer in solution is stable toward unfolding and show that at least two unfolding pathways exist. In our scenario, the formation of a hydrophobic core is a milestone in the folding process which must occur along all the routes that lead this protein towards its native state. Furthermore, the ensemble of folding pathways proposed here substantiates a rational drug design strategy based on inhibiting the folding of HIV–1 protease

    An Evolutionary Trade-Off between Protein Turnover Rate and Protein Aggregation Favors a Higher Aggregation Propensity in Fast Degrading Proteins

    Get PDF
    We previously showed the existence of selective pressure against protein aggregation by the enrichment of aggregation-opposing β€˜gatekeeper’ residues at strategic places along the sequence of proteins. Here we analyzed the relationship between protein lifetime and protein aggregation by combining experimentally determined turnover rates, expression data, structural data and chaperone interaction data on a set of more than 500 proteins. We find that selective pressure on protein sequences against aggregation is not homogeneous but that short-living proteins on average have a higher aggregation propensity and fewer chaperone interactions than long-living proteins. We also find that short-living proteins are more often associated to deposition diseases. These findings suggest that the efficient degradation of high-turnover proteins is sufficient to preclude aggregation, but also that factors that inhibit proteasomal activity, such as physiological ageing, will primarily affect the aggregation of short-living proteins
    corecore